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The most massive galaxies and the richest clusters are believed to have 
emerged from regions with the largest enhancements of mass density 1 4 
relative to the surrounding space. Distant radio galaxies may pinpoint the 
locations of the ancestors of rich clusters, because they are massive systems 
associated with overdensities of galaxies that are bright in the Lyman- 
a line of hydrogen 5 " 7 . A powerful technique for detecting high-redshift 
galaxies is to search for the characteristic 'Lyman break' feature in the 
galaxy colour, at wavelengths just shortwards of Lya, due to absorption 
of radiation from the galaxy by the intervening galactic medium. Here we 
report multicolour imaging of the most distant candidate 7 9 protocluster, 
TN J1338-1942 at a redshift z « 4.1. We find a large number of objects 
with the characteristic colours of galaxies at that redshift, and we show 
that this excess is concentrated around the targeted dominant radio galaxy. 
Our data therefore indicate that TN J1338-1942 is indeed the most distant 
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cluster progenitor of a rich local cluster, and that galaxy clusters began 
forming when the Universe was only 10 per cent of its present age. 

There is increasing evidence that structures of galaxies existed in the early Universe, 
but the detection of protoclusters at redshifts z > 1 using conventional optical and 
X-ray techniques is difficult 10_12 . Some of us have developed an efficient method 
for pinpointing distant protoclusters. The technique is based on the hypothesis that 
the most powerful known high-redshift radio galaxies are frequently associated with 
massive forming galaxies 13-16 in protoclusters 5 . As a first step towards testing this 
hypothesis, we recently conducted a large programme with the Very Large Telescope 
(VLT) of the European Southern Observatory to search for galaxy overdensities asso- 
ciated with protoclusters around luminous high-redshift radio galaxies. Deep narrow- 
and broad-band imaging was used to locate candidate galaxies having bright Lya emis- 
sion, and follow-up spectra have confirmed that most of these candidates have similar 
redshifts to the high-redshift radio galaxies. All five targets studied with the VLT 
to sufficient depth have > 20 spectroscopically confirmed Lya and/or Ha companion 
galaxies, associated with galaxy overdensities 6 ' 7 . Their formal velocity dispersions are 
a few hundred km s _1 , but there was not enough time since the Big Bang for them 
to have become virialized. The scale sizes of the structures inferred from their spatial 
boundaries are ~ 3 — 5 Mpc. Assuming that the overdensities are due to a single 
structure, the masses derived from the observed structure sizes and overdensities are 
comparable to those of clusters of galaxies in the local Universe 7 . These observations 
led us to hypothesize that the overdensities of Lya galaxies around radio sources are 
due to the fact that they are in protoclusters. 

Galaxies that emit strong Lya comprise only a small fraction of distant galaxies, 
and are biased towards non-dusty objects and galaxies that are undergoing the most 
vigorous star formation. Only about 25% of z ~ 3 galaxies have Lya equivalent 
widths detectable by our VLT narrow-band imaging searches 17 ' 18 . If the overdensities 
of Lya galaxies are located in protoclusters, additional galaxy populations should 
be present and detectable on the basis of characteristic continuum features in the 
galaxy spectra. The most important of these features is the sharp 'Lyman break' 
blueward of Lya, caused by the absorption of the galaxy continuum radiation by 
neutral hydrogen clouds along the line of sight. Searching for Lyman-break galaxies 
is a powerful technique for finding highredshift galaxies 11 ' 19,20 . 

Because of its high spatial resolution, large field of view and excellent sensitivity, the 
Advanced Camera for Surveys 21 (ACS) on the Hubble Space Telescope is uniquely 
suited for studying the morphologies of galaxies in the protoclusters and for finding 
additional galaxies on the basis of the Lyman-break features in their spectra. We 
therefore used the ACS to observe the most distant of our VLT protoclusters, TN 
J1338-1942 (ref. 7) at z — 4.1. This is a structure with 21 spectroscopically confirmed 
Lja emitters and a rest-frame velocity dispersion of 325 km s -1 . Images were taken 
through three 'Sloan' filters-g band centred at 4,750 A, r band centred near 6,250 A 
and i band centred near 7,750 A. These filters were chosen so that their wavelength 
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responses bracketed redshifted Lya at 6,214 A and were sensitive to the Lyman-break 
feature blueward of Lya. 

A 3.4' x 3.4' field was observed, with the radio galaxy located ~ V from the image 
centre. Besides the radio galaxy, this field covered 12 of the 21 known Lya emitting 
galaxies in the candidate protocluster. All 12 objects were detected in both r band 
and i band, with i-band magnitudes ranging from 25 to 28, compared with 23.3 ±0.03 
for the radio galaxy. As illustrated in Fig. 1, these objects were either absent or 
substantially attenuated in the g band, and their g — r colours are generally consistent 
with predicted values of Lyman breaks 22 . Half of the objects are extended in i 775 , and 
three of these are resolved into two distinct knots of continuum emission, suggestive 
of merging. 

We next used the Lyman-break technique to search for a population of Lyman-break 
galaxies in the protocluster that do not emit strong Lya and would therefore have 
been undetectable in our VLT observations. Evidence for the existence of such a 
population was sought by analysing the number and spatial distribution of 'g-band 
dropout' objects-that is, objects whose colours are consistent with Lyman breaks 
in their spectra at the redshift of the protocluster. To investigate whether there is a 
statistically significant excess of such g-band dropout objects, we estimated the surface 
density and cosmic variance of g-band dropouts in a typical ACS field observed with 
the same filters and to the same depth as TN J1338-1942. We did this by cloning 23 
£?4 35 -band dropouts in 15 different pointings from the southern field of the Great 
Observatories Origins Deep Survey (GOODS) 24 . Results indicate that the number of 
g-band dropouts in our field is a factor of 2.5 times higher than the average number 
found in a random GOODS field. Taking account of the typical cosmic variance 25 in 
the distribution of z m 4 Lyman-break galaxies, this is a 3a excess on the assumption 
that the distribution function is gaussian. Further evidence that a substantial fraction 
of these g-dropout objects are Lyman-break galaxies associated with the protocluster 
is provided by the strong concentration of the g-band dropouts in a cluster-sized region 
around the radio galaxy. This is illustrated in Fig. 2. More than half of the g-band 
dropouts are located in a region of ~ V in radius (corresponding to a diameter of ~ 1 
Mpc at z = 4.1). The number of g-band dropouts in this region is a factor of 5 times 
the average number encountered in similarly sized regions that are randomly drawn 
from the GOODS survey. This is a 5a excess, indicating that the number of g-band 
dropouts in our field is anomalously high at greater than the 99% confidence level. 
The spatial non-uniformity of g-band dropout objects in our field becomes even more 
pronounced when fainter objects down to a magnitude of = 27 are included (Fig. 2). 

Are there alternative explanations for the observed excess of g-band dropout objects 
other than a population of Lyman-break galaxies at z ~ 4.1? An object with a 
Balmer break at z ~ 0.5 could also be observed as a g-band dropout object. However, 
a population of such z ~ 0.5 objects would also be present in the GOODS comparison 
sample. Although the existence of an intervening structure of Balmer-break galaxies 
at z ~ 0.5 cannot be completely ruled out, its coincidence in location with the z ~ 4.1 
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structure of Lya galaxies and the faintness and small sizes of the observed objects 
make this possibility highly unlikely. 

The spatial coincidence of the excess in g-band dropout objects with the previously de- 
tected overdensity of Lya emitters around a forming massive galaxy is strong evidence 
that we are observing a new population of Lyman-break galaxies in a protocluster. 
This would mean that TN J1338-1942, at z ~ 4.1, is indeed the most distant known 
protocluster, and that distant luminous radio galaxies pinpoint the progenitors of 
nearby rich clusters. Such protoclusters provide an opportunity to study the develop- 
ment of galaxies and clusters in the early Universe. They provide samples of different 
galaxy populations at the same distance, whose morphologies and spectral energy 
distributions could be used to disentangle the evolution and star formation history 
of different types of galaxies. The topological information that could be derived by 
mapping the shapes and sizes of such protoclusters over larger areas could answer the 
question of whether the first protoclusters in the early Universe formed in sheets or 
filaments. 
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Figure 1: Deep images of Lya-emitting protocluster galaxies. Images show galaxy 
morphologies observed through three filters: g band (left), r band (middle) and i 
band (right). Each 2.5" x 2.5" image has been smoothed by a gaussian function with 
a fullwidth at half-maximum of 1.5 pixels (0.074"). The observations were carried 
out between 8 and 12 July 2002 with the Wide Field Channel of the ACS 21 . The 
total observing time of 13 orbits was split over the broad-band filters F475W (g 
band, four orbits), F625W (r band, four orbits) and F775W (i band, five orbits), 
thereby bracketing redshifted Lya at 6,214 A. During each orbit, two 1,200-s exposures 
were made, to facilitate the removal of cosmic rays. The observations were processed 
through the ACS GTO pipeline 26 to produce registered, cosmic-ray- rejected images. 
The limiting 2a magnitudes in a 0.2-arcsec 2 aperture were 28.71 (F475W), 28.44 
(F625W) and 28.26 (F775W). Object detection and photometry were then obtained 
using SExtractor 27 . a, The clumpy radio galaxy TN J1338-1942 at z — 4.1. This is 
the brightest galaxy in the protocluster and inferred to be the dominant cluster galaxy 
in the process of formation. Because the equivalent width of Lya is large (~ 500 A), 
the r band is dominated by Lya. Arrows indicate the positions 8 of the radio core 
(C) and the northern hotspot (H). The Lya emission is elongated in the direction 
of the radio emission and the large-scale Lya halo 7 with a projected linear size of 
~ 15 kpc (assuming H = 65 km s _1 Mpc -1 , Qm — 0.3, Qm — 0.7). b, Images of 
five spectroscopically confirmed Lya emitters in the protocluster. Listed below each 
galaxy are its spectroscopic redshift 7 , the magnitude of the observed Lyman break, 
and the i-band magnitude. Two of the Lya emitters are clumpy, as expected from 
young galaxies. 

Figure 2: The spatial distribution of g-band dropout objects. Superimposed on 
the combined 3.4' x 3.4' ACS greyscale image are the locations of g-band dropout 
objects (blue circles), selected to have colours and magnitudes of (g — r) > 1.5, 
(g — r) > (r — i) + 1.1, (r — i) < 1 and % < 27. In addition, objects were required to 
have a SExtractor 27 stellarity parameter of less than 0.85 to ensure that the sample 
was not contaminated by stars. These criteria filter galaxies having Lyman breaks at 
z ~ 4, thereby providing a sample of protocluster Lyman-break galaxy candidates. 
We detected 30 g-dropout objects in the field around TN J1338-1942 with i 775 < 26, 
and 56 with i 775 < 27. The number of g-band dropout objects is anomalously large, 
and their distribution is concentrated within a circular region of ~ 1' in radius that 
includes the radio galaxy TN J1338-1942 (large green circle). Also shown are the 
positions of the spectroscopically confirmed Lya emitters (red squares). Because the 
selection criteria were optimized to detect Lyman-break galaxies, some of the Lya 
emitters did not fall into the formal sample of Lyman-break galaxies. The measured 
excess and its spatial clustering are evidence that a substantial fraction of the g-band 
dropout objects are Lyman-break galaxies associated with the protocluster. (See Fig. 
1 legend for further details about the observations and the subsequent analysis.) Scale 
bar, 1'. 
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